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Smoothed analysis is a new way of analyzing algorithms introduced by Spielman 
and Teng {J. ACM, 2004). Classical methods like worst-case or average-case anal- 
ysis have accompanying complexity classes, like P and Avg-P, respectively. While 
worst-case or average-case analysis give us a means to talk about the running time 
of a particular algorithm, complexity classes allows us to talk about the inherent 
difficulty of problems. 

Smoothed analysis is a hybrid of worst-case and average-case analysis and com- 
pensates some of their drawbacks. Despite its success for the analysis of single 
algorithms and problems, there is no embedding of smoothed analysis into com- 
putational complexity, which is necessary to classify problems according to their 
intrinsic difficulty. 

We propose a framework for smoothed complexity theory, define the relevant 
classes, prove some first results, and study some examples. 

1 Introduction 

Computational complexity theory is the science of classifying computational problems accord- 
ing to their intrinsic difficulty. While the analysis of algorithms is concerned with analyzing, 
say, the running time of a particular algorithm, complexity theory rather analyses the amount 
of resources that all algorithms need at least to solve a given problem. 

Classical complexity classes, like P, reflect worst-case analysis of algorithms. A problem is 
in P if there is an algorithm whose running time on all inputs of length n is bounded by a 
polynomial in n. Worst-case analysis has been a success story: The bounds obtained are valid 
for every input of a given size, and, thus, we do not have to think about typical instances of 
our problem. If an algorithm has a good worst-case upper bound, then this is a very strong 
statement: The algorithm will perform well in practice. (For practical purposes, "good upper 
bound" of course also includes constants and lower order terms.) 

However, some algorithms - like quicksort or the simplex algorithm for linear programming 
- work well in practice, despite having a provably high worst-case running time. The reason 
for this is that the worst-case running time can be dominated by a few pathological instances 
that rarely or never occur in practice. An alternative to worst-case analysis is average-case 
analysis. Many of the algorithms with poor worst-case but good practical performance have a 



good average running time. This means that the expected running time with instances drawn 
according to some fixed probabihty distribution is low. 

In complexity-theoretic terms, P is the class of all problems that can be solved with polyno- 
mial worst-case running time. In the same way, the class Avg-P is the class of all problems that 
have polynomial average-case running time. Average-case complexity theory studies the struc- 
tural properties of average-case running time. Bogdanov and Trevisan give a comprehensive 
survey of average-case complexity [9]. 

While worst-case complexity has the drawback of being often pessimistic, the drawback of 
average-case analysis is that random instances have often very special properties with high 
probability. These properties of random instances distinguish them from typical instances. 
Since a random and a typical instance is not the same, a good average-case running time 
does not necessarily explain a good performance in practice. In order to get a more realistic 
performance measure (and, in particular, to explain the speed of the simplex method), Spielman 
and Teng have proposed a new way to analyze algorithms called smoothed analysis [27]. In 
smoothed analysis, an adversary chooses an instance, and then this instance is subjected to 
a slight random perturbation. We can think of this perturbation as modeling measurement 
errors or random noise or the randomness introduced by taking, say, a random poll. The 
perturbation is controlled by some parameter (p, called the perturbation parameter. Spielman 
and Teng have proved that the simplex method has a running time that is polynomial in 
the size of the instance and the perturbation parameter [27]. (More precisely, for any given 
instance, the expected running time on the perturbed instance is bounded by a polynomial.) 
Since then, the framework of smoothed analysis has been applied successfully to a variety of 
algorithms that have a good behavior in practice (and are therefore widely used) but whose 
worst-case running time indicates poor performance [2-4,7,8,10-13,21,24-26,29]. We refer to 
two recent surveys for a broader picture of smoothed analysis [20,28]. 

However, with only few exceptions [5,23], smoothed analysis has only been applied yet to 
single algorithms or single problems. Up to our knowledge, there is currently no attempt to 
formulate a smoothed complexity theory and, thus, to embed smoothed analysis into compu- 
tational complexity. 

This paper is an attempt to define a smoothed complexity theory, including notions of 
intractability, reducibility, and completeness. We define the class Smoothed-P (Section 2), 
which corresponds to problems that can be solved smoothed efficiently, we provide a notion 
of reducibility (Section 3), and define the class Dist-NPpara; which is a smoothed analogue of 
NP, and prove that it contains complete problems (Section 4). We continue with some basic 
observations (Section 5). We also add examples of problems in Smoothed-P (Sections 6 and 7). 
Finally, since this is an attempt of a smoothed complexity theory, we conclude with a discussion 
of extension, shortcomings, and difficulties of our definitions (Section 8). 

2 Smoothed Polynomial Time and Smoothed-P 

2.1 Basic Definitions 

In the first application of smoothed analysis to the simplex method [27], the strength of the 
perturbation has been controlled in terms of the standard deviation a of the Gaussian pertur- 
bation. While this makes sense for numerical problems, this model cannot be used for general 
(discrete problems). A more general form of perturbation models has been introduced by Beier 
and Vocking [4]: Instead of specifying an instance that is afterwards perturbed (which can also 
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be viewed as the adversary specifying the mean of the probabihty distribution according to 
which the instances arc drawn), the adversary specifics the wliolc probabihty distribution. Now 
the role of the standard deviation a is taken over by the parameter cj), which is an upper bound 
for the maximum density of the probabihty distributions. For Gaussian perturbation, we have 
a = 6(1/^). Because we do not want to restrict our theory to numerical problems, we have 
decided to use the latter model. 

Let us now define our model formally. Our perturbation models are families of distributions 
V = {Dn,x,<j))- The length of x is n (so we could omit the index n but we keep it for clarity). 
Note that length does not necessarily mean bit length, but depends on the problem. For 
instance, it can be the number of vertices of the graph encoded by x. 

For every n, x, and cp, the support of the distribution Dn^x,<j) should be contained in the set 
{0, l}<P°iy("). Let Sn,x = {y I Dn,xAy) > for some 0}, and let 

For all n, x, cf), and y, we demand Dn^x,<f){y) ^ This controls the strength of the per- 
turbation and restricts the adversary. We allow (j) € [l/A^^^j:, 1]. Furthermore, the values of 
(p are discretized, so that they can be described by at most poly(n) bits. = 1 corresponds 
to the worst-case complexity; we can put all the mass on one string. ^ = 1/Nn,x models the 
average case; here we usually have to put probability on an exponentially large set of strings. 
In general, the larger cp, the more powerful the adversary. We call such families {Dn^x,(j))n,x,<t> 
of probability distributions parameterized families of distributions. 

Now we can specify what it means that an algorithm has smoothed polynomial running-time. 
The following definition can also be viewed as a discretized version of Beier and Vocking's 
definition [5]. Note that we do not speak about expected running-time, but over expected 
running-time to some power e. This is because the notion of expected running-time is not 
robust with respect to, e.g., quadratic slowdown. The corresponding definition for average- 
case complexity is due to Levin [17]. We refer to Bogdanov and Trevisan [9] for a thorough 
discussion of this issue. 

Definition 2.1. An algorithm A has smoothed polynomial running time with respect to the 
family V if there exists an £ > such that, for all n, x, and (p, we have lEy~D„ ^ ^ (^^(y; n, 0)^) = 
0{n-Nn,x-4>)- 

This definition implies that (average-)polynomial time is only required if we have (j) = 
0(poly(ra)/A^n,a:)- This seems to be quite generous at first glance, but it is in accordance 
with, e.g., Spielman and Teng's analysis of the simplex method [27] or Beier and Vocking's 
analysis of integer programs [5] ; they achieve polynomial time running time only if they perturb 
all but at most O(logn) digits. 

In average-case complexity, one considers not decision problems alone, but decision problems 
together with a probability distribution. The smoothed analogue of this is that we consider 
tuples (L,P), where L C {0,1}* is a decision problem and D is a parameterized family of 
distributions. We call such problems parameterized distributional problems. 

As some examples for distributional problems, we will study the bounded halting problem, 
binary integer optimization, and graph problems under the smoothed extension of Gn,p. 

The notion of smoothed polynomial running-time (Definition 2.1) allows us to define what 
it means for a parameterized distributional problem to have polynomial smoothed complexity. 

Definition 2.2. Smoothed-P is the class of all (L,!)) such that there is a deterministic algo- 
rithm A with smoothed polynomial running time that decides L. 
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We start with an alternative characterization of smoothed polynomial time, like it is known 
for average case running time. It basically says that an algorithm has smoothed polynomial 
running-time if and only if its running-time has polynomially decreasing tail bounds. Though 
smoothed polynomial time is a generalization of average case polynomial time, the characteri- 
zation and the proof of equivalence is similar. 

Theorem 2.3. An algorithm A has smoothed polynomial running time if and only if there is 
an £ > and a polynomial p such that for all n, x, (j), and t, 

Pr [tA{y\n,4>)>t\<^-Nr,,^-4>. 

Proof. Let A be an algorithm whose running time tA fulfills 

{tA{y\ n, (pf) =0{n- Nn,x4') ■ 

The probability that the running time exceeds a certain value t can be bounded by Markov's 
inequality: 

ntA{y; n, 4>)>t) = ^{tA{y; n, 4>Y > t') < '^y-''r.,.,,My;n,4>r) ^^^^ ^^^^^ 

For the other direction, assume that 

Pr [tA{y;n,(l))>t]< — -Nn,x(l> 
for some constants c and e. Let e' = e/(c + 2). Then we have 

^y-D„,.A^A{y;n,4>r') = Y,ntA{y;n,<t>f' > t) 

t 

<n + ^F{tA{y;n,(l))>t^/'') 

t>n 

< n + ^ . jY^^^^ ^ „ + 0{Nn,x(f>). 

t>n 

□ 

2.2 Heuristic Schemes 

A different way to think about efficiency in the smoothed setting is via so-called heuristic 
schemes. This notion comes from average-case complexity [9], but can be adapted to our 
smoothed setting. The notion of a heuristic scheme comes from the observation that, in 
practice, we might only be able to run our algorithm for a polynomial number of steps. If the 
algorithms does not succeed within this time bound, then it "fails", i.e., it does not solve the 
given instance. The failure probability decreases polynomially with the running time that we 
allow. The following definition captures this. 

Definition 2.4. Let {L,V) be a smoothed distributional problem. An algorithm A is an error- 
less heuristic scheme for {L, V) if there is a polynomial q such that 
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1. For every n, every x, every (p, every S > 0, and every y G supp D„^x,^, we have 
A{y\ n, 4>, 6) outputs either L{y) or ±. 

2. For every n, every x, every (f), every S > 0, and every y G swpp Dn,x,4>) we have 
tA{y;n,6) <q{n,Nn,x<t>A/5). 

3. For every n, x, (f), S > 0, and y G suppD^,^,,^, we have FTy^Dn^^^^[A{y;n,(f),d) = ±]<S. 

With the definition of a heuristic scheme, we can prove that heuristic schemes precisely 
characterize Smoothed- P. 

Theorem 2.5. {L,T>) G Smoothed-P if and only if {L,T>) has an errorless heuristic scheme. 
Proof. Let A be an algorithm for (L,T)). By Theorem 2.3, the probability that 

¥{tA{y; n,4>)>t)=0{n- Nn,x4> ■ t-') . 

We get an errorless heuristic scheme B from A as follows: Simulate A for {n- Nn,x4>/^)^^^ steps. 
If A stops within these number of steps, then output whatever A outputs. Otherwise, output 
_L. By the choice of the parameters, the probability that B outputs _L is bounded by 6. 

For the other direction, let A be an errorless heuristic scheme for (L, V). We get an algorithm 
with smoothed polynomial running time by first running A with d = 1/2, then with S = 1/4, 
and in the ith iteration with S = 1/2*. Whenever A does not answer _L, B gives the same 
answer and stops. B will eventually stop, when 6 < Dn^x,(t>{y)- For i iterations, B needs 

i 

J2 Qin, Nn,x^, 2^) < poly(n, Nn,x<l>) ■ 2" 

j=i 

for some constant c. B stops after i iterations for all but a 2~* fraction of the input. Thus B 
has smoothed polynomial running time. (Choose e < 1/c.) □ 

2.3 Alternative Definition: Bounded Moments 

At first glance, one might be tempted to use "expected running time" for the definition of Avg-P 
and Smoothed-P. However, as mentioned above, simply using the expected running time does 
not yield a robust measure. This is the reason why the expected value of the running time 
raised to some (small) constant power is used. 

However, Roglin and Teng [22, Theorem 6.2] proved that for integer programming (more 
precisely, for binary integer programs with a linear objective function), the expected value 
indeed provides a robust measure. More precisely, they proved that a binary optimization 
problem can be solved in expected polynomial time if and only if it can be solved in worst-case 
pseudo-polynomial time. The reason for this is that all finite moments of the Pareto curve are 
polynomially bounded. Thus, a polynomial slow down does not cause the expected running 
time to jump from polynomial to exponential. 

As far as we are aware, this phenomenon, i.e., the case that all finite moments have to be 
bounded by a polynomial, has not been studied yet in average-case complexity. Thus, for 
completeness, we define the corresponding average-case and smoothed complexity classes as 
an alternative to Avg-P and Smoothed-P. 
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Definition 2.6. 1. An algorithm has robust smoothed polynomial running time with re- 
spect to V if, for all fixed e > and for every n, x, and (f), we have 



Smoothed-PBM is the class of all (L,!)) for which there exists a deterministic algorithm 
with robust smoothed polynomial running time. (The "PBM" stands for "polynomially 
bounded moments".) 

2. An algorithm A has robust average polynomial running time with respect to T> if, for all 
fixed £ > and for all n, we have E^^/j^ (tyi(y)^) = 0(n). Avg-PBM contains all {L,V) 
for which there exists a deterministic algorithm with robust smoothed polynomial running 
time. 

From the definition, we immediately get Smoothed-PBM C Smoothed-P and Avg-PBM C 
Avg-P. Moreover, if L € P, then L together with any family of distributions is also in 
Smoothed-P and Avg-P and, thus, also in Smoothed-PBM and Avg-PBM. From Roglin and 
Teng's result [22], one might suspect that Avg-P = Avg-PBM as well as Smoothed-P = 
Smoothed-PBM. However, this does not hold. 

Theorem 2.7. Avg-PBM C Avg-P and Smoothed-PBM C Smoothed-P. 

Proof. Wc only prove the theorem for average-case complexity. The proof for the smoothed 
complexity case is almost identical. 

By the time hierarchy theorem [1], there is a language L' G DTime(2") such that L' ^ 
DTime(2"(")). Consider the following language L = {xO" | \x\ = n,x & L'}. Let V = (D^) 
be a hard probability distribution for L', i.e., (L',P') is as hard to solve as L' in the worst 
case [18, 19] 

Let V = (Dn) be given as follows: 



Dn{xy) 



2 " • -D(i(x) if \x\ = n and j/ = 0" and 
2~^"^ otherwise. 



Since L' G DTime(2"), we have {L,T>) G Avg-P: L can be decided in expected time time 
2-n . 2" + o{n) = 0{n). Now we prove that {L,V) ^ Avg-PBM. If {L,V) G Avg-PBM were 
true, then 2"" • IEa;~D^(t(a;)'^) would be bounded by a polynomial for all fixed c. Here, t is the 
time needed to solve the L' instance x. 

Our choice of D' , Jensen's inequality, and the fact that L' ^ DTime(2''(")) implies that 
Ej,^D'Jt{xy) > E^^D'Jt{x)y = 2'=-"("). Thus, for some sufficiently large c, 2"" • Ea,^D;(i(x)'=) 
exceeds any polynomial. □ 

Question 2.8. Does there exist a language L G NP together with some ensemble V such that 
{L,V) separates Avg-P from Avg-PBM and Smoothed-P from Smoothed-PBM ? Does there exist 
some L together with a computable ensemble T> that separates these classes? 

We conjecture that, assuming the exponential time hypothesis (ETH) [16], such an L G NP 
exists. Given the ETH, 3SAT requires time 2^("') in the worst case, thus also on average if we 
use the universal distribution. This holds even if we restrict 3SAT to 0(n) clauses. However, 
n is here the number of variables, not the bit length of the input, which is roughly 0(ralogra). 
Thus, a direct application of the ETH seems to be impossible here. 
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3 Disjoint Supports and Reducibility 



The same given input y can appear with very high and with very low probabihty at the same 
time. What sounds hke a contradiction has an easy explanation: Dn^x,4>iy) can be large whereas 
Dn,x',4>{y) for some other x' is small. But if we only see we do not know whether x or x' 
was perturbed. This causes some problems when one wants to develop a notion of reduction 
and completeness. 

For a parameterized distributional problem (L,!?), let 

Lds = {{x,y) \ y e L and |y| < poly(|x|)} 

The length of \y\ is bounded by the same polynomial that bounds the length of the strings in 
any snpp Dn^x,4>- We will interpret a pair {x,y) as "y was drawn according to -D„,a;,</,". 

With the notion of L^g, we can now define a reducibility between parameterized distributional 
problems. We stress that, although the definition below involves Ljg and L'^^, the reduction is 
defined for pairs L and L' and neither of the two is required to be a disjoint-support language. 

Definition 3.1. Let {L,!)) and {L',T>) be two parameterized distributional problems. {L,T>) 
reduces to (L',T>') (denoted by "{L,T>) <smoothcd (L',!)')") if there is a polynomial time com- 
putable function f such that for every n, every x, every <p and every y G suppZ)„^a;^0 the 
following holds: 

1. {x,y) G Lds if and only if f{{x,y);n,(l)) G L'^^. 

2. There exist polynomials p and m such that, for every n, x, and (f) and every y' G 

Dn,x,4>{y) ^ P(.'^)Dm{n)M{x,yy,n,<l>)Ay')^ 

y-f2{{x,y);n,<j>)=y' 

where f{{x,y);n,<^) = {fi{{x,y);n,(f)), f2{{x,y);n,(f))). 

Remark 3.2. We could also allow that (p on the right-hand side is polynomially transformed. 
However, we currently do not see how to benefit from this. 

It is easy to see that <smoothed is transitive. Ideally, Smoothed-P should be closed under 
this type of reductions. However, we can only show this for the related class of problems with 
disjoint support. 

Definition 3.3. Smoothed-Pds is the set of all distributional problems with disjoint supports 
such that there is an algorithm A for L^g with smoothed polynomial running time. (Here, the 
running time on {x,y) is defined in the same way as in Definition 2.1. Since \y\ < poly(|a;|) 
for a pair {x,y) G -L^s, it is o.k. to measure the running time in \x\.) 

Now, Smoothed-Pds is indeed closed under the above type of reductions. 

Theorem 3.4. If {L,V) <smoothed (^',^') and (L^s'^') ^ Smoothed-Pds, then {L^s^V) G 
Smoothed-Pds- 
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Proof. Let A' be a an errorless heuristic scheme for (L'^^,!)'). Let / be the reduction from 
{L,T>) to {V ,T)') and let p and ra be the corresponding polynomials. 

We claim that ?/); n, ^, 5) = A'{f{{x,y);n,(p);m{n),4>,6/p{n)) is an errorless heuristic 

scheme for {L^s,V). To prove this, let 

B = {y' e supp I A\{f{{x,y);n,(j)),y')]m{n),(p,6/p{n)) = ±} 

be the set of string on which A' fails. 

Because A' is a heuristic scheme, we have D'^^f^^) fi({x y)-n ip) <^(-^) — ^/pi''^)- Therefore, 

Pr {A{{x, y)-n, 4>, 5)=!.)= Pr {A'{f{{x, y);n, 4>y, m{n),^, 6/p{n) = ±) 

Dn,x,4>iy) 

y-f2((x,y)\n,4>)£B 
y'eB 

= Pin)D'm(ny,M{x,y};n,^y,<t>iB) < S. 

Thus, (Lds, V) G Smoothed-Pds- □ 

With the definition of disjoint support problems, a begging question is how the complexity of 
L and arc related. It is obvious that (L, V) G Smoothed-P implies (L^s, 25) £ Smoothed-Pds. 
However, the converse is not so obvious. The difference between L and Lds is that for Lds, we 
get the x from which the input y was drawn. While this extra information does not seem to 
be helpful at a first glance, we can potentially use it to extract randomness from it. So this 
question is closely related to the problem of derandomization. 

But there is an important subclass of problems in Smoothed-Pds whose counterparts are 
in Smoothed-P, namely those which have an oblivious algorithm with smoothed polynomial 
running time. We call an algorithm (or heuristic scheme) for some problem with disjoint 
supports oblivious if the running time on {x,y) does not depend on x (up to constant factors). 
Let Smoothed-Pds^ be the resulting subset of problems in Smoothed-Pds that have such an 
oblivious algorithm with smoothed polynomial running time. 

Theorem 3.5. For any parameterized problem {L,V), {L,V) G Smoothed-P if and only if 
{Lds,V) e Smoothed -P^^'. 

Proof. Let A be an oblivious algorithm with smoothed polynomial running time for Lds- Since 

A is oblivious, we get an algorithm for L with the same running time (up to constant factors) 
by running A on {xq, y) on input y, where a;o is an arbitrary string of length n. □ 

Note that almost all algorithms, for which a smoothed analysis has been carried out, do not 
know the x from which y was drawn; in particular, there is an oblivious algorithm for them. 

Question 3.6. Is there a problem {L,V) ^ Smoothed-P but (Lds, 2?) £ Smoothed-Pds? 

Note that in Lds, each y is paired with every x, so there is no possibility to encode infor- 
mation by omitting some pairs. This prohibits attempts for constructing such a problem like 
considering pairs (x,/(x)) where / is some one-way function. 

For the classes Smoothed-BPP or Smoothed-P/poly, which can be defined in the obvious 
way, knowing x docs not seem to help, since, given y, it should be possible to use the internal 
random bits (or the advice) to find an x' that is good enough. 
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4 Parameterized Distributional NP 



4.1 Dist-NPpara 

In this section, we define the smoothed analogue of the worst-case class NP and the average- 
case class DistNP [15,17]. First, we have to restrict ourself to "natural" distributions. This 
rules out, for instance, probability distributions based on Kolmogorov complexity that (the 
universal distribution), under which worst-case complexity equals average-case complexity for 
all problems [18]. We transfer the notion of computable ensembles to smoothed complexity. 

Definition 4.1. A parameterized family of distributions is in PCompp3j.a^ if the cumulative 
probability 

-^£'n,a;,0 ~ Dn,X,<f> 
Z<X 

is can he computed in polynomial time (given n, x and (j) in binary). 

With this notion, we can define the smoothed analogue of NP and DistNP. 
Definition 4.2. Dist-NPpara = {{L,V) andVe PCompp^ra}- 

4.2 Dist-NPpara-Complete Problems 

Bounded Halting. Having defined Dist-NPpara in the previous section, we now prove that 

bounded halting - given a Turing machine, an input, and a running-time bound, does the 
Turing machine halt on this input within the g ivcn time bound — is complete for Dist-NPpara* 
Bounded halting is the canonical N P-complcte language, and it has been the first problem that 
has been shown to be Avg-P-complete [17]. 
Formally, let 

BH = {{g, X, 1*) 1 NTM with Godel number g accepts x within t steps}. 

In order to show that BH is Dist-NPpara-complete, we need a "compression function" for 
probability distributions. 

Lemma 4.3. Let V = {Dn,x,4>) € PComppara an ensemble. There exists a deterministic 
algorithm C such that the following holds: 

1. C{y;n,x,(j)) runs in time polynomial in n and (j) for all y € supp-D„^a;,^- 

2. For every y, y' € supp Dn,x,4>, C{y; n, x, (p) = C{y'; n, x, (p) implies y = y' . 

3. IfDn^xAv) < 2~'^'; then |C(y; n, x, (^)| = l + |y|. Else, |C7(j/; n, x, </))| = log ^^^^ + c • 
log n + 1 for some constant c. 

Proof Consider any y £ supp{Dn,x,<i))- If Dn,x,<j,{y) < 2"'^', then let C{y;n, x, (j)) = Oy. If 
Dn,x,<j>iy) > 2~l^l, then let y' be the string that precedes y in lexicographic order, and let 
p = Fn^ x^{y')- Then we set C(y; n, x, </>) = la, where a is the longest common prefix of the 
binary representation of p and Fo^^^iy) =p + Dn,x,<t>{y)- Since V G PComppara, the string z 
can be computed in polynomial time. 

We have Dn,x,<t>{y) < 2~l"', since adding Dn,x,<t>iy) leaves the first |a| bits of p unchanged. 



9 



Let z be another string, z' its predecessor and h the longest common prefix oiq = Fd^ ^ ^i^') 
and q + -D„.x-.f/)(^')- The intervals [p,p+ Dn.x.(f,{y)) '''ii^ [q.q + Dn .j. (f,{y)) are disjoint by con- 
struction. Therefore, a and b have to be different, because otherwise these intervals would 
intersect. 

Let c be such that \y\ < rf for all y G supp(D„,j;,0). We set 

C{y;n,x,(p) = 1 bin(|a|)ao'°^ 

(Note that log — > \a\.) Here bin(|a|) is a fixed length binary encoding of a. We can 
bound this length by clog n. The total length of C{y; n, x, 0) is 

\C{y;n,x,(p)\ = 1 + clogn + log . 

Dn,xAyj 

It remains to be proved that C is injective. Let C{y;n,x,(f)) = C{z-,n,x,(l)). If C{y;n,x,(f)) 
starts with a 0, then obviously y = z. If C{y; n, x, (/)) starts with a 1, then the prefixes a and b 
are the same. Therefore y = z by the consideration above. □ 

The instances of BH are triples {g, x, 1*) of length 2 log \g\ + 2 log \x\ + \x\ + 151 + ^ + ©(1)- Let 

^c^ ■ 2-l^'l iig = g' and AT = \{g',x', and > log ^ 



otherwise. 



Above, is an appropriate scaling factor. More precisely, c,^ is the reciprocal of the number 
of possible lengths for a string x', i.e., it is of order tv-E^- particular, U^^^^ ^{y) < (f> 
for all y. 

Theorem 4.4. (BH, ?7^^) is Dist-NPpara-comp/eie under polynomial-time smoothed reductions. 

Proof. Let {L,V) G Dist-NPpara be arbitrary. Let p{n) be an upper bound for the length of 
the strings in any supp(I?„ a; ,^). Let M be a nondeterministic machine that accepts an input 
a if and only if there is a string y € L with C{y; n, x, </>) = a. Let q be an upper bound on the 
running time of M. Let g be the Godel number of M. Our reduction maps a string y to 

fi{x, y);n, x, 4>) = (^{g, C{x- n, x, cf)), 1*), {g, C{y; n, x, (f)), 1* 

where t and t' chosen in such a way that they are larger than q{p{\x\)). (And t and t' should 
be chosen in such a way that all tuples have the same length N{n).) 
By construction, {x,y) G Ljs if and only if f{{x,y);n,x,(/)) G BHds- 

Domination remains to be verified. Since C is injective, at most one y is mapped to {g, a, 1*) 
given n, x, and (p. We have 

f^'"(,,C(.;n,.,^),i*),^((5, C{y; n, x, 0), 1*')) = ■ 2-l^(^'^"'-'^)l. 

If \C{y;n,x,(j))\ < log D^l,^{y) + dogn + 1, then 

and domination is fulfilled. If |C(y; n, z, 0)| = 1 + |y|, then 

Uf,i^g,C[.;n,x,^),^A^9^C{y-,ri,xA), 1*')) < ' 2"l^l-^ < 2c^ ■ D^^^^y). 

This completes the hardness proof. The completeness follows since (BH, U^^) is indeed contained 
in Dist-NPpara- □ 
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Tiling. The original DistNP-complete problem by Levin [17] was Tiling (see also Wang [30]): 
An instance of the problem consists of a finite set T of square tiles, a positive integer t, and a 
sequence s = (si, . . . , s„) for some n <t such that Sj matches Sj+i (the right side of Si equals 
the left side of Sj+i). The question is whether S can be extended to tile an n x n square using 
tiles from T. We use the following probability distribution for Tiling: 



Here, a is the number of possible choices in T for each initial tile Sj. With this, we obtain the 
following theorem. 

Theorem 4.5. (Tiling, J7^^-'-^"S) is Dist-NPpara-comp/ete under <smoothed- 

Proof. By construction, wc have (Tiling, L'^'^^-'-^"^) G Dist-NPpara- For simplicity, we assume 
that the set T of tiles always contains two tiles encoding the input bits "0" and "1" and that 
these are the only possible tiles for the initial tiling (si, . . . , s„). (The problem does not become 
easier without this restriction, but the hardness proof becomes more technical.) 

For the hardness, (BH, ^7^'^) reduces to (Tiling, ^/'^^^^"S) because the Turing machine com- 
putations can be encoded as tiling problems in a straightforward way [30] (the Godel number 
g maps to some set T of tiles, and the input x maps to the initial tiling s). Finally, Item 2 of 
the reduction (Definition 3.1) is fulfilled can be seen to be satisfied because of the similarity 
between the two probability distributions. □ 

5 Basic Observations 

In this section, we collect some simple facts about Smoothed-P and Dist-NPpara and their re- 
lationship to their worst-case and average-case counterparts. First, Smoothed-P is sandwiched 
between P and Avg-P, which follows immediately from the definitions. 

Theorem 5.1. If L eP, then {L,V) G Smoothed-P for any V. If {L,V) G Smoothed-P with 

= iDn,x,(f>)n,x,<i), then (L, (I?n,x„,</.)n) £ Avg-P for (p = O {poly {n)/Nn,x) and every sequence 
{xn)n of strings with \xn\ < poly(n). 

Second, for unary languages, Avg-P and P coincide. The reason is that for unary languages, 
we have just one single instance 1" for each length n, and this instance has a probability of 1. 
Also Smoothed-P coincides with Avg-P and P for unary languages. Because the set of instances 
is just a singleton, the parameter cf) is fixed to 1 in this case. 

This observation allows us to transfer the result that DistNP C Avg-P implies NE = E [6] 
to smoothed complexity. (The latter classes are defined as NE = NTime(2^H) and E = 
DTime(2'^("^).) The transfer of their result to smoothed complexity is straightforward and 
therefore omitted. 

Theorem 5.2. // Dist-NPpara Q Smoothed-P, then NE = E. 

This gives some evidence that Dist-NPpara is not a subset of Smoothed-P. (We cannot have 
equality because Dist-NPpara is restricted to computable ensembles and problems in NP, while 
Smoothed-P does not have these restrictions.) 




\s'\ [iT = T' and N = \{T', s' , 1*')| and \T'\ > log 



otherwise. 
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6 Tractability 1: Integer Programming 



In this section, we will deal with tractable - in the sense of smoothed complexity - opti- 
mization problems: We will show that if a binary integer linear program can be solved in 
pseudo-polynomial time, then the corresponding decision problem belongs to Smoothed-P. 
This result is similar to Beier and Vockings characterization [5] : Binary optimization problems 
have smoothed polynomial complexity (with respect to continuous distributions) if and only if 
they can be solved in randomized pseudo-polynomial time. We follow their notation and refer 
to their lemmas wherever appropriate. 

Setup and Probabilistic Model. A binary optimization problem is an optimization prob- 
lem of the form 

maximize (F'x 

subject to wj X < ti for all i G [k] and 
xeSC {0,1}". 

Here, c^x = Yl^=i ^j^j linear objective function and wfx = Yl]=i ''^ij^j ^ linear 

constraints. Furthermore, we have the constraint that the binary vector x must be contained 
in the set S. This set S should be viewed as containing the "structure" of the problem. 
Examples are that S contains all binary vectors representing spanning trees in a graph of n 
vertices or that S represents all paths connecting two given vertices or that S contains all 
vectors corresponding to perfect matchings of a given graph. Maybe the simplest case is = 1 
and S = {0, l}"; then the binary program above represents the knapsack problem. 

We assume that S is adversarial (i.e., non-random). Since we deal with decision problems 
in this paper rather than with optimization problems, we use the standard approach and 
introduce a threshold for the objective function. This means that the optimization problem 
becomes the question whether there is an .x G 5 that fulfills c^x > b as well as wfx < ti for all 
i G {1, . . . , k}. In the following, we treat the budget constraint c^x > 6 as an additional linear 
constraint for simplicity. We call this type of problems binary decision problems. 

For ease of presentation, we assume that we have just one linear constraint (whose coefficients 
will be perturbed) and everything else is encoded in the set S. This means that the binary 
decision problem that we want to solve is the following: Does there exist an x G S with 
w'^x < ti 

The values tui, . . . , are n-bit binary numbers. Thus, u;^ G {0, 1, . . . , 2" — 1}. While we can 
of course vary their length, we choose to do it this way as it conveys all ideas while avoiding 

another parameter. 

Let us now describe the perturbation model. We do not make any assumption about the 
probability distribution of any single coefficient. Instead, our result holds for any family of 
probability distribution that fulfills the following properties: 

• wi,...,Wn are drawn according to independent distributions. The set S and the threshold 

2 

t are part of the input and not subject to randomness. Thus, Nn^(s,w,t) = 2" for any 
instance {S, w, t) of size n. We assume that S can be encoded by a polynomially long 
string. (This is fulfilled for most natural optimization problems, like TSP, matching, 
shortest path, or knapsack.) 

• The fact that wi, . . . ,Wn are drawn independently means that the probability for one 
coefficient to assume a specific value is bounded from above by (f)^/'^. 
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Since Nn^(s,w) = 2" , the perturbation parameter (f) can vary between 2""" (for the average 
case) and 1 (for the worst case). 

The idea is as follows: If we have a pseudo-polynomial algorithm, then we can solve instances 
with O(logn) bits per coefficient efficiently. Our goal is thus to show that O(logn) bits suffice 

2 

with high probability. (This is for the average case, i.e., ^ = 2~" . For larger cj), more bits are 
needed.) The proofs in the following are similar to proofs by Beier and Vocking [5]. However, 
at various places it gets slightly more technical because we have discrete rather than continuous 
probability distributions. 

Technical Lemmas and Main Theorem. The following simple lemma bounds the prob- 
ability that a certain coefficient assumes a value in a given small interval. 

Lemma 6.1. Let S,z € N. Let a be an n-bit coefficient drawn according to some discrete 
probability distribution bounded from above by <j)^^^. Then Pr(a G [z,z + 5)) < ^I'^b. 

Proof. There are exactly 8 outcomes of a that lead to a G [z, ^ + 5). Thus, Pr(a G \z,z^6)) < 
(j)^/"'e. □ 

Our goal is to show that 0(log(n(/>^/"2"')) bits for each coefficient suffice to determine whether 
a solution exists. (For the average case, we have cp = 2^" , thus O(logn) bits per coefficient.) 
To do this, it is not sufficient for an x G 5 to just satisfy w^x < t: Because of the rounding, 
we might find that x is feasible with respect to the rounded coefficients whereas x is infeasible 
with respect to the true coefficients. Thus, what we need is that w^x is sufficiently smaller 
than t. Then the rounding does not affect the feasibility. Unfortunately, we cannot rule out 
the existence of solutions x e S that are very close to the threshold (after all, there can 
be an exponential number of solutions, and it is likely that some of them are close to the 
threshold). But it is possible to prove the following: Assume that there is some ranking among 
the solutions x G S. Let the winner be the solution G 5 that fulfills w^x* < t and is 
ranked highest among all such solutions. Then it is likely that t — vFx* is not too small. Now, 
any solution that is ranked higher than x* must be infeasible because it violates the linear 
constraint uF x < t. Let x be the solution that minimizes uFx — t among all solutions ranked 
higher than x* . Then it is also unlikely that w^x — t extremely small, i.e., that x violates the 
linear constraint by only a small margin. 

Remark 6.2. In Beier and Vocking 's analysis [5], the ranking was given by the objective 
function. We do not have an objective function here because we deal with decision problems. 
Thus, we have to introduce a ranking artificially. In the following, we use the lexicographic 
ordering (if not mentioned otherwise), which satisfies the following monotonicity property that 
simplifies the proofs: if x ^ S is ranked higher than y € S, then there is an i with Xi = 1 and 

yi = o. 

Now let X* be the winner (if it exists), i.e., the highest ranked (with respect to lexicographic 
ordering) solution among all feasible solutions. Then we define the winner gap as 

^^^^ W"^x* if there exists a feasible solution and 

1 _L otherwise. 

The goal is to show that it is unlikely that F is small. In order to analyze F, it is useful to 
define also the loser gap A. The loser x G 5 is a solution that is ranked higher than x* but cut 
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off by the constraint tiFx < t. It is the solution with minimal uFx — t among all such solutions. 
(If there is a tie, which can happen because we have discrete probability distributions, then 
we take the highest-ranked solution as the loser.) We define 




w X — t if there exists a loser x and 
± otherwise. 



The probability that A or F is smaller than some value S is bounded by S(f)^/"n, which we will 
prove in the following. 

The following lemma states that it suffices to analyze A in order to get bounds for both A 
and r. In fact, for the setting with just one linear constraint with non-negative coefficients, we 
do not even need the winner gap. But the winner gap is needed for more general cases, which 
we discuss briefly below but do not treat in detail for conciseness. 

Lemma 6.3 (discrete version of [5, Lemma 7]). For all t and 6, we have Pr(r(t) < S) = 
Pr(A(t-5) <6). 

Proof. A solution x € S" is called Pareto- optimal if there is no other solution x' ^ S such that 
w'^x' < iiFx and x' is ranked higher than x. Let us make two observations. First, we observe 
that both winners and losers are Pareto-optimal. Second, for every Pareto-optimal solution 
x, there exists a threshold t such that x is the loser for this particular threshold. To see this, 
simply set t = tiF x — 1. 

Let P C 5 be the set of Pareto-optimal solutions. Then 

T(t) = min{t — w'^x | x G P, w^x < t} and 

A(t) = mm{w^x — t \ x E P, vFx > i) = TCi.ui{'uF x — t | x G P, nF x >t + l\. 

Now V{t) < 5 \i and only if there is an x G P with t — uFx G {0, . . . , 5 — 1}. This is equivalent 
to ui^x — t G {—5 -I- 1, . . . , 0} and to nFx — (t — 5) G {1, . . . , 5}. In turn, this is equivalent to 

A(t -5)<5. □ 

Now we analyze A(t). The following lemma makes this rigorous. It is a discrete counterpart 
to Beier and Vocking's separating lemma [5, Lemma 5]. We have to assume that the all-zero 
vector is not contained in S. The reason for this is that its feasibility does not depend on any 
randomness. 

Lemma 6.4 (separating lemma). Suppose that (0, ...,0) ^ S. For every (5, t G N, we have 
Pr(r(t) <5)< 54>^/''n and Vv{k{t) < 5) < dct^/'^n. 

If we use a non-monotone ranking, then the hounds for the probabilities become 5^^/"n^. 

Proof. Because of Lemma 6.3, it suffices to analyze the loser gap A. We only give a proof 
sketch for monotone rankings as that emphasis the differences to the continuous counterpart [5, 
Lemma 5]. 

Let Si = {x e S \ Xi = 1}, and let Si = S\Si = {x e S \ Xi = 0}. Let x** G Si be the winner 
from Sf. X** is ranked highest in Si and satisfies the linear constraint w'^x*'^ < t. Let x* G Si 
be the loser with respect to x*% i.e., a solution that is ranked higher than x** and minimizes 
w'^x'^ — t (if such a solution exists) . Let 




7i;'^.T* — t if X* exists and 
_L otherwise. 
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Note that X* can be feasible and, thus, Aj can be negative. 

To analyze Aj, we assume that all wj with j =^ i are fixed by an adversary. The winner .x** 
docs not depend Wi because all solutions x & Si have Xi = 0. Once x** is fixed, also x* is fixed. 
Because wj for j ^ iis fixed and xl = 1, we can rewrite uFx^ — t = z + Wi. Now Aj G {1, . . . , (5} 
if Wi assumes a value in some interval of length S, which happens with a probability of at most 

Furthermore, if A 7^ _L, then there exists an i with Aj = A [5, Claim B]. Thus, a union 
bound over all n possibilities for i yields Pr(A(f) < S) < 5(j?-^'^n. □ 

For the probabilistic constraint vF x < t, it is not sufficient for an x to satisfy it. Instead, 
we want that only a few bits of each coefficient of w suffice to find an x that satisfies that 
constraint. Here, "few" means roughly 0(log(n(?!)i/"2")). (Note that this is roughly O(logn) 
if we are close to the average case, where ~ 2~" .) Different from Beier and Vocking's 
continuous case (where the real- valued coefficients where revealed by an oracle), we have the 
true coefficients at hand. Thus, we do not need their certificates that a solution is indeed 
feasible, but we can simply test with the true coefficients. Clearly, this testing can be done in 
polynomial time. 

For an n-bit natural number a and 6 € N, let [ajb = be the number obtained from a by only 
taking the b most significant bits. This means that [a\b = 2""^ • [a/2"~^J. 

In order to show that pseudo-polynomiality implies smoothed polynomial complexity, we 
use a pseudo-polynomial algorithm as a black box in the following way: We run the pseudo- 
polynomial algorithm with the highest O(logn) bits. (To do this, we scale the rounded co- 
efficients of w down. Furthermore, we also have to scale t down appropriately.) If we find a 
solution, then we check it against the true coefficients of w. If it remains feasible, we output 
"yes". If it becomes unfeasible, then we take one more bit for each coefficient and continue. 
The following lemma gives a tail bound for how long this can go on. 

Lemma 6.5. Assume that we use b bits for each coefficient of w. Let x* be the winner (with 
respect to the true w without rounding). The probability that solving the problem with b bits 
for each coefficient yields a solution different from x* is bounded from above by 2^~^(f?-/'^n^ . 

Proof. We only get a solution different from if there is a solution x ranked higher than x* 
that is feasible with respect to the rounded coefficients. By rounding, we change each coefficient 
by at most 2"-^ Thus, w'^x - [wj^x < 2"-*n. 

We can conclude that we find x instead of x* only if the loser gap A is at most 2"'^'-'n, 
which happens with a probability of at most 2^~^(j)^/'^v? (or 2"'~*0i/"'n^ if the ranking is not 
monotone) . □ 

Theorem 6.6. // a binary decision problem can be solved in pseudo-polynomial time, then it 
is in Smoothed -P. 

Proof. We have to show that the running time of the algorithm sketched above, which uses 
the pseudo-polynomial algorithm as a black box, fulfills Theorem 2.3. 

If b bits for each coefficient are used, the running time of the pseudo-polynomial algorithm is 
bounded from above by 0((n2^)^) for some constant c. (Even the total running time summed 
over all iterations up to b bits being revealed is bounded by 0((n2'')'^), because it is dominated 
by the last iteration.) 
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The probability that more than time t = 0{{n2^Y) is needed is bounded from above by 
2n-b^i/n^2 according to Lemma 6.5. We can rewrite this as 

2 

The last inequality holds because > 2~" . The theorem is proved because this tail bound 
for the running time is strong enough according to Theorem 2.3. □ 



Examples and Discussion. Examples of problems in Smoothed-P are the decision problems 
associated with the following NP-hard optimization problems: 

• knapsack, where the goal is to find a subset of a given collection of items that maximizes 
the profit while obeying a budget for its weight; 

• constrained shortest path, where the goal is to find a path of minimum length that obeys 
a certain a budget; 

• constrained minimum-weight spanning tree. 

These problems can be solved in pseudo-polynomial time using dynamic programming, even 
if we insist on a lexicographically maximal solution (as we have to for Lemma 6.4). 

Let us now discuss some extensions of the model. We have restricted ourselves to deter- 
ministic pseudo-polynomial algorithms, which yield smoothed polynomial complexity. These 
deterministic algorithms can be replaced without any complication by randomized errorless 
algorithms that have expected pseudo-polynomial running time. 

So far, we have not explicitly dealt with constraints of the form "w-^x > t" . But they can 
be treated in the same way as "w^x < t", except that winner and loser gap change their 
roles. Furthermore, we did not include the case that coefficients can be positive or negative. 
This yields additional technical difficulties (we have to round more carefully and take both 
winner and loser gap into account), but we decided to restrict ourselves to the simpler form 
with non-negative coefficients for the sake of clarity. Moreover, we have not considered the 
case of multiple linear constraints [5, Section 2.3] for the same reason. Finally, Roglin and 
Vocking [23] have extended the smoothed analysis framework to integer programming. We 
believe that the same can be done for our discrete setting. 

The main open problem concerning Smoothed-P and integer optimization is the following: 
Beier and Vocking [5] have proved that (randomized) pscudo-polynomiality and smoothed 
polynomiality are equivalent. The reason why we do not get a similar result is as follows: Our 
"joint density" for all coefficients is bounded by 0, and the density of a single coefficient is 
bounded by 0^/". In contrast, in the continuous version, the joint density is bounded by 
while a single coefficient has a density bounded by (p. 

However, our goal is to devise a general theory for arbitrary decision problems. This theory 
should include integer optimization, but it should not be restricted to integer optimization. The 
problem is that generalizing the concept of one distribution bounded by for each coefficient 
to arbitrary problems involves knowledge about the instances and the structure of the specific 
problems. This knowledge, however, is not available if we want to speak about classes of 
decision problems as in classical complexity theory. 
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7 Tractability 2: Graph Problems 



In this section, we have a look at another example in the class Smoothed-P. We consider graph 
problems. The perturbation model that we choose is the smoothed extension ofGn,p [28]: Given 
an adversarial graph G = {V,E) and an e G (0, 1/2], we obtain a new graph G' = {V,E') on 
the same set of vertices by "flipping" each (non-)edge of G independently with a probability 
of £. This means the following: If e = {u, v} e E, then e is contained in E' with a probability 
of 1 - e. If e = {u, v} ^ E, then Pr(e G E') = e. 

Transferred to our framework, this means the following: We represent a graph G on n vertices 
as a binary string of length (2), and we have Nn^G = 2(2). The flip probability e depends on 

0: We choose e < 1/2 such that (1 — e)^^^ = (p. (For = 2~(2) = l/Nn^c, we have a fully 
random graph with edge probabilities of 1/2. For = 1, we have £ = 0, thus the worst case.) 

We will not present an exhaustive list of graph problems in Smoothed-P, but focus on graph 
coloring for a very simple example. /c-Coloring is the decision problem whether the vertices of 
a graph can be colored with k colors such that no pair of adjacent vertices get the same color. 
/c-Coloring is NP-complete for any k>3 [14, GT 4]. 

To show that A;-Coloring G Smoothed-P, we analyze the following simple algorithm: First, 
we check whether the input graph contains a clique of size k + l. This can be done easily in 
polynomial time. If yes, we output no. If no, we perform exhaustive search. 

Theorem 7.1. For any k eN, A; -Coloring G Smoothed-P. 

Proof. The algorithm for A;-Coloring is quite simple and the analysis is similar to Wilf's 
analysis [31] of the coloring problem: First, we check whether the input graph contains a 
clique of size A; -|- 1. This can be done easily in polynomial time. If yes, we output no. If no, 
we perform exhaustive search. The correctness of the algorithm is obvious. 

A graph is A;-colorable only if it does not contain a clique of size A; -|- 1. The probability that 
a specific set of A; -I- 1 vertices form a A; -|- 1 clique is at least ev 2 Thus, the probability that 

a graph G on n vertices does not contain a A; -|- 1 clique is at most 

n 

We distinguish two cases: First, e > 0.1. In this case, can be bounded from 

above by for some positive constant c < 1 that depends on k. Brute-force testing whether 
a graph can be A;-colored can be done in time poly(n) • A;". The probability that we need brute 
force is at most c". Thus, the expected running-time, raised to the power e = log;j(l/c), is 
bounded from above by a polynomial. 

Second, e < 0.1. Then we have ((> = {I - e)^^) > 0.9(2). The allowed running-time (raised 
to some constant power) is Nn,G(f>n = 2(2)^71 > 1.8(2). Thus, we can afford exhaustive search 
in every run. □ 

8 Discussion 

Our proposed framework has many of the characteristics that one would expect. We have 
reductions and complete problems and they work in the way one expects them to work. To 
define reductions, we have to use the concept of disjoint supports. It seems to be essential that 
we know the original instance x that the actual instance y was drawn from to obtain proper 
domination. Although this is somewhat unconventional, we believe that this is the right way 
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to define reductions in the smoothed setting. The reason is that otherwise, we do not know 
the probabihtics of tlic instances, which we need in order to apply tlic compression function. 
The compression function, in turn, seems to be crucial to prove hardness results. Still, an open 
question is whether a notion of reducibility can be defined that circumvents these problems. 

Many of the positive results from smoothed analysis can be cast in our framework, like it is 
done in Sections 6 and 7. 

• A large fraction of these positive results are originally formulated as continuous problems. 
However, we have to work in a discrete model, which makes things considerably more 

complicated. 

• Many positive results in the literature state their bounds in the number of "entities" (like 
number of nodes, number of coefficients) of the instance. However, in complexity theory, 
we measure bounds in the length (number of symbols) of the input in order to get a 
theory for arbitrary problems, not only for problems of a specific type. To state bounds 
in terms of bit length makes things less tight, for instance the reverse direction of integer 
programming does not work. But still, we think it is more important and useful to use 
the usual notion of input length such that smoothed complexity fits with average-case 
and worst-case complexity. 

Finally, the results by Roglin and Teng [22] show that, for binary optimization problems, 
expected polynomial is indeed a robust measure. We have shown that this is in general not the 
case. To do this, we have used a language in E. The obvious question is now whether Avg-P 
and Avg-PBM as well as Smoothed- P and Smoothed-PBM coincide for problems in NP. 

We hope that the present work will stimulate further research in smoothed complexity theory 
in order to get a deeper understanding of the theory behind smoothed analysis. 
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